Evaluating Machine Translation Performance on Chinese Idioms with a Blacklist Method

نویسندگان

  • Yutong Shao
  • Rico Sennrich
  • Bonnie L. Webber
  • Federico Fancellu
چکیده

Idiom translation is a challenging problem in machine translation because the meaning of idioms is non-compositional, and a literal translation is likely to be wrong. In this paper, we assess the quality of idiom translation of a modern neural MT system. We introduce a new evaluation method based on an idiom-specific blacklist of literal translations, based on the insight that the occurrence of any blacklisted words in the translation output indicates a likely translation error. We introduce a dataset, CIBB (Chinese Idioms Blacklists Bank), and perform an evaluation of a state-of-the-art Chinese-English neural MT system. Our evaluation confirms that our blacklist method is effective at identifying literal translation errors, and that a sizable number of idioms in our test set are mistranslated (36.5%).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of a Substitution Method for Idiom Transformation in Statistical Machine Translation

We evaluate a substitution based technique for improving Statistical Machine Translation performance on idiomatic multiword expressions. The method operates by performing substitution on the original idiom with its literal meaning before translation, with a second substitution step replacing literal meanings with idioms following translation. We detail our approach, outline our implementation a...

متن کامل

IJCNLP 2008 Sixth SIGHAN Workshop on Chinese Language Processing

In this paper, we propose an example-based decoder for a statistical machine translation (SMT) system, which is used for spoken language machine translation. In this way, it will help to solve the re-ordering problem and other problems for spoken language MT, such as lots of omissions, idioms etc. Through experiments, we show that this approach obtains improvements over the baseline on a Chines...

متن کامل

On Integrated Translation Approach of English Idioms

Idioms are the gems of a language. Translating idioms is not an easy task because not all English idioms have equivalents in Chinese idioms due to linguistic, especially cultural differences. Research indicates that there are six translation tactics in translating English idioms into Chinese: literal translation, free translation, compensatory translation, explanational translation, borrowing, ...

متن کامل

Strategies Employed in Translation of Idioms in English Subtitles of Two Persian Television Series

Translation of idioms seems to be complicated for most translators since the meaning of idioms is difficult and sometimes impossible to be deduced from the meaning of their individual components. Considering the difficulties of translation of idioms and also the specific constraints of subtitling such as space and time limits, this research studied the strategies employed in translation of idio...

متن کامل

On the Effects of Linguistic, Verbal, and Visual Mnemonics on Idioms Learning

Finding more effective ways of teaching second language idioms has been a long standing concern of many teaching practitioners and researchers. This study was an endeavorto explore the effects of three linguistic mnemonic devices (etymological elaboration, keyword method, and translation) on EFL learners’ recognition and recall of English idioms. To achieve the purpose of the study, ninety male...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1711.07646  شماره 

صفحات  -

تاریخ انتشار 2017